An Initial Quality Analysis of the Ohloh Software Evolution Data

نویسنده

  • Magiel Bruntink
چکیده

Large public data sets on software evolution promise great value to both researchers and practitioners, in particular for software (development) analytics. To realise this value, the data quality of such data sets needs to be studied and improved. Despite these data sets being of a secondary nature, i.e., they were not collected by the people using them, data quality is often taken for granted, casting doubt on conclusions drawn from those data. This paper reports on an intial investigation of the quality of the software evolution data available on Ohloh, and further describes steps taken to cleanse the data set. Our goal is that other researchers, practitioners, and parties responsible for data sets such as Ohloh, use the outcomes of the validation and cleansing steps to improve quality of data sets in the public domain.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discovering Determinants of Project Participation in an Open Source Social Network

Successful open source software projects often require a steady supply of self motivated software developers. However, little work has been done from a relational/network perspective to study the factors that drive the developers to participate in OSS projects. In this paper, we investigate the participation dynamics in a social network, particularly in an online open source community called Oh...

متن کامل

Towards Base Rates in Software Analytics

Nowadays a vast and growing body of open source software (OSS) project data is publicly available on the internet. Despite this public body of project data, the field of software analytics has not yet settled on a solid quantitative base for basic properties such as code size, growth, team size, activity, and project failure. What is missing is a quantification of the base rates of such propert...

متن کامل

Existence of Mild Solutions to a Cauchy Problem Presented by Fractional Evolution Equation with an Integral Initial Condition

In this article, we apply two new fixed point theorems to investigate the existence of mild solutions for a nonlocal fractional Cauchy problem with an integral initial condition in Banach spaces.

متن کامل

Investigating Relationships Between FLOSS Foundations and FLOSS Projects

Foundations function as vital institutional support infrastructures for many of the most successful open source projects, but the role of these support entities remains an understudied phenomenon in FLOSS research. Drawing on Open Hub (formerly known as Ohloh) data, this paper empirically investigates the different ways these entities support projects and interact with different projects and wi...

متن کامل

Effective Strategies for Optimal Implementation of Evolution and Innovation Packages in Medical Education

ABSTRACT BACKGROUND AND OBJECTIVE: Evolution and innovation packages in medical science education are the main program of medical education and it is necessary to pay attention to the provision of infrastructure of their implementation. This study was conducted to identify effective strategies for optimal implementation of evolution and innovation packages in medical education. METHODS: The met...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • ECEASST

دوره 65  شماره 

صفحات  -

تاریخ انتشار 2014